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© Amplification of long nucleic acid sequences by PCR. 

© Methods and reagents are provided for the amplification of nucleic acid sequences, e.g. DNA sequences, 
longer than 10 kilobases by the polymerase chain reaction (PCR). The methods use compositions consisting of a 
primary thermostable DNA polymerase from Thermus thermophilus combined with a lesser amount of a 
secondary thermostable DNA polymerase possessing a 3^0-5' exonuclease activity from Thermococcus 
litoralis, Pyrococcus species GB-D or Thermotoga maritima. The DNA polymerase compositions, when used 
with the disclosed reaction buffer, enable amplifications of DNA sequences up to at least 42.2 kilobases in 
length. 
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The present invention relates generally to the fields of molecular biology and nucleic acid chemistry. 
More specifically, it relates to methods for the amplification of long nucleic acid sequences by the 
polymerase chain reaction. 

The polymerase chain reaction (PCR), a powerful tool for the amplification of nucleic acid sequences, is 
5 disclosed in U.S. Patent Nos. 4,683,202; 4,683,195; 4,800,159; and 4,965,188. In its simplest form, PCR is 
an in vitro method for the enzymatic synthesis of specific DNA sequences, using two oligonucleotide 
primers that hybridize to complementary strands and flank the region of interest in the target DNA. A 
repetitive series of reaction steps involving template denaturation, primer annealing, and the' extension of 
the annealed primers by a DNA polymerase results in the geometric accumulation of a specific fragment 
10 whose termini are defined by the 5* ends of the primers. PCR is capable of producing a selective 
enrichment of a specific DNA sequence by a factor of 10 9 . The PCR method is also described in Saiki et 
al., 1985, Science 230:1350-1354. 

PCR has been widely applied in molecular biology, molecular evolution, medical genetics, population 
genetics, forensic biology, and genome mapping and sequencing projects. However, current PCR are 
75 limited in the size of the region of DNA that can be amplified reliably. - ^ 

Attempts to overcome the length limitations of PCR are reported in Glukhov et al., 1991, Molek. Biol. 
25:1602-1610; Kainz et al., 1992, Anal. Biochem. 202:46-49; Ohler and Rose, 1992, PCR Meth. Applic. 2:51- 
59; Ponce and Micol, 1992, Nucl. Acids Res. 20:623; and Rychlik et al., 1990, Nucl. Acids Res. 18:6409- 
6412. Although amplifications of .5-15 kb sequences were achieved, the reported yields of the longer 
20 ■ products were low. ■ 

PCR methods capable of amplifying long nucleic acid sequences would facilitate genomic mapping and - 
sequencing as well as molecular cloning through the amplification of long, low-copy insert material, and by 
making possible the assembly of larger recombinant constructions in PCR-based mutagenesis. There 
remains a need for methods that will enable PCR amplification of targets of at least 25 kb with high yields. 
25 ' , The present invention provides improved methods and reagents for the PCR amplification of long DNA 
• targets. ; - 

One aspect of the invention relates to combinations of thermostable DNA polymerases which are useful 
in the methods of the present invention. The combinations consist primarily of Thermus thermophilus DNA 
polymerase, a highly active thermostable DNA polymerase that does not exhibit S'-to-S' exonuclease 
30 activity, and secondarily of either Thermococcus lit oralis, Pyrococcus species GB-D, or Thermotoga 
maritima DNA polymerase, all thermostable DNA polymerases that exhibit S'-to-S* exonuclease activity. . 

Another aspect of the invention relates to a buffer useful for carrying out the amplification of long 
targets. 

Another aspect of the present invention relates to PCR amplifications using the specific combinations of 
35 thermostable enzymes described above. The reaction conditions are specified so as to enable the 
amplification of nucleic acid target sequences of up to 42 kilobases in length. 

Another aspect of the invention relates to kits comprising reagents useful in carrying out the, methods of 
the present invention. Such kits comprise a combination of thermostable DNA polymerases as described 
above and, optionally, additional amplification reagents which are useful in the methods of the present 
40 invention. 

To aid in understanding the invention, several terms are defined below, 
r The term "amplification reaction mixture", as used herein, refers to an aqueous solution comprising the 
J various amplification reagents used to amplify a target nucleic acid. The reagents include primers, 
enzymes, aqueous buffers, salts, target nucleic acid, and deoxynucleoside triphosphates (both conventional 
45 and unconventional). Depending on the context, the mixture can be either a complete or incomplete reaction 
mixture. 

The terms "nucleic acid" and "oligonucleotide", as used herein, refer to primers, probes, and oligomer 
fragments to be detected, and shall be generic to polydeoxyribonucleotides (containing 2-deoxy-D-ribose), 
to polyribonucleotides (containing D-ribose), and to any other type of polynucleotide which is an N- 

50 glycoside of a purine or pyrimidine base, or modified purine or pyrimidine bases. There is no intended 
distinction in length between the term "nucleic acid" and "oligonucleotide", and these terms will be used 
interchangeably. These terms refer only to the primary structure of the molecule. Thus, these terms include 
double- and single-stranded DNA, as well as double- and single-stranded RNA. 

Because mononucleotides are reacted to make oligonucleotides in a manner such that the 5* phosphate 

55 of one mononucleotide pentose ring is attached to the 3 f oxygen of its neighbor in one direction via a 
phosphodiester linkage, an end of an oligonucleotide is referred to as the "5* end" if its 5' phosphate is not 
linked to the 3' oxygen of a mononucleotide pentose ring and as the "3* end" if its 3* oxygen is not linked 
to a 5* phosphate of a subsequent mononucleotide pentose ring. 
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The exact size of an oligonucleotide depends on many factors and the ultimate function or use of the 
oligonucleotide. Oligonucleotides can be prepared by any suitable method, including, for example, cloning 
and restriction of appropriate sequences and direct chemical synthesis by a method such as the 
phosphotriester method of Narang et al., 1979, Meth. Enzymol. 68:90-99; the phosphodiester method of 

s Brown et al., 1979, Meth. Enzymol. 68:109-151; the diethylphosphoramidile method of Beaucage et aL, 
1981, Tetrahedron Lett. 22:1859-1862; and the solid support method of U.S. Patent No. 4,458,066. A review 
of synthesis methods is provided in Goodchild, 1990, Bioconjugate Chemistry 1(3):165-187. 

The term "hybridization", as used herein, refers to the formation of a duplex structure by two single 
stranded nucleic acids due to complementary base pairing. Hybridization can occur between complemen- 

10 tary nucleic acid strands or between nucleic acid strands that contain minor regions of mismatch. Stability 
of a nucleic acid duplex is measured by the melting temperature, or "T m ." The T m is the temperature 
(under defined ionic strength and pH) at which 50% of the base pairs have dissociated. Those skilled in the 
art of nucleic acid technology can determine duplex stability empirically considering a number of variables 
including, for example, the length of the oligonucleotide, base composition and sequence of the 

75 oligonucleotide, ionic strength, and incidence of mismatched base pairs. 

Conditions under which only fully complementary nucleic acid strands will hybridize are referred to as 
"stringent hybridization conditions". Stringent hybridization conditions are well known in the art (see, e.g., 
Sambrook et al., 1985, Molecular Cloning - A Laboratory Manual, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, New York. Generally, stringent conditions are selected to be about 5°C lower than the T m 

20 for the specific sequence at a defined ionic strength and pH. Typically, stringent conditions will be those in 
which the salt concentration is at least about 0.2 molar at pH 7 and the temperature is at least about 60 ° C. 
Relaxing the stringency of the hybridizing conditions will allow sequence mismatches to be tolerated; the 
degree of mismatch tolerated can be controlled by suitable adjustment of the hybridization conditions. 

Two single-stranded nucleic acids that are complementary except for minor regions of mismatch are 

25 referred to as "substantially complementary". Stable duplexes of substantially complementary sequences 
can be achieved under less stringent hybridization conditions. Those skilled in the art of nucleic acid 
technology can determine duplex stability empirically considering a number of variables including, for 
example, the length and base pair concentration of the oligonucleotides, ionic strength, and incidence of 
mismatched base pairs. 

30 The term "primer", as used herein, refers to an oligonucleotide, whether natural or synthetic, capable of 
acting as a point of initiation of DNA synthesis under conditions in which synthesis of a primer extension 
product complementary to a nucleic acid strand is induced, i.e., in the presence of four different nucleoside 
triphosphates and an agent for polymerization (i.e., DNA polymerase or reverse transcriptase) in an 
appropriate buffer and at a suitable temperature. A primer is preferably a single-stranded oligodeox- 

35 yribonucleotide. The appropriate length of a primer depends on the intended use of the primer but typically 
ranges from 15 to 35 nucleotides. Short primer molecules generally require cooler temperatures to form 
sufficiently stable hybrid complexes with the template. 

A primer need not reflect the exact sequence of the template but must be sufficiently complementary to 
hybridize with a template. Primers can incorporate additional features which allow for the detection or 

40 immobilization of the primer but do not alter the basic property of the primer, that of acting as a point of 
initiation of DNA synthesis. For example, non-complementary sequences can be located at the ends of the 
primer to provide restriction enzyme cleavage sites useful in the cloning of an amplified sequence. 

The terms "upstream" and "downstream", as used herein, refer to the location of the primer binding 
sites along the target sequence. The upstream primer hybridizes to the non-coding strand of the target 

45 sequence, and therefore forms the 5* end of the amplified sequence which is a subsequence of the coding 
strand of the target sequence. Similarly, the downstream primer hybridizes to the coding strand of the 
target sequence, and therefore forms the 5' end of the amplified sequence which is a subsequence of the 
non-coding strand of the target sequence. 

The terms "target sequence" and "target nucleic acid sequence", as used herein, refer to a region of 

so the oligonucleotide which is to be amplified, detected, or both. The target sequence resides between the 
two primer sequences used for amplification. 

The term "thermostable nucleic acid polymerase", as used herein, refers to an enzyme which is 
relatively stable to heat when compared, for example, to nucleotide polymerases from E. coll, and which 
catalyzes the polymerization of nucleoside triphosphates. Generally, the enzyme will initiate synthesis at the 

55 3'-end of the primer annealed to the target sequence, and will proceed in the 5*-direction along the template 
until synthesis terminates. 

The methods of the present invention use specific combinations of a DNA polymerase from Thermus 
thermophilus (Tth) with a DNA polymerase from either Thermotoga maritima (7/na), Pyrococcus species 
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GB-D, or Thermococcus lit oralis (Til). 

The terms "S'-to-S' nuclease activity" and "proofreading activity", as used herein, refer to that activity 
of a template-specific nucleic acid polymerase whereby nucleotides are removed from the 3' end of an 
oligonucleotide in a sequential manner, 
s A unit (U) of polymerase activity is a measure of the amount of enzyme needed to synthesize nucleic 

acid at a given rate. The activity units specified herein are as defined by the respective suppliers of each 
polymerase, as listed below. Because activities may be assayed under different specific conditions, activity 
of one enzyme may not be directly comparable to activity of another enzyme. 

Recombinant DNA polymerases from Thermus thermophl/us (rTth) and Thermatoga maritima - 
w (UlTma) are commercially available from Perkin Elmer, Norwalk, CT. One unit of rTth or UlTma™ DNA 
polymerase is defined by the commercial supplier, Perkin Elmer, as the amount of enzyme that will 
incorporate 10 nmoles of dNTP into acid insoluble material at 74 *C in 30 minutes, as measured in a 10. 
minute incubation in a 50 \i\ reaction consisting of the following: 

200 U.M each dATP, dGTP, dTTP 
15 . 100 uM [«- 32 P]-dCTP (0.05 to 0.1 Ci/mmole) 

activated salmon sperm DNA : 

100 mM KCI 

2.2 mM MgCfe 

25 mM TAPS [tris-(hydroxymethyl)-methyl-aminp-propanesulfonic acid, sodium salt], pH 9.3 at 25 *C . 
20 1 mM beta-mercaptoethanol 

Recombinant DNA polymerases from Thermococcus litoralis (Vent R ®) and Pyrococcus species GB-D 
(Deep Verit R ®) are commercially available from. New England Biolabs, Beverly, MA. One unit of Vent R ® or 
Deep Vent R ®. DNA polymerase is defined by the commercial supplier, New England Biolabs, as the amount 
of enzyme that will incorporate 10 nmoles of dNTP into acid insoluble material at 75 • C in 30 minutes in a 
25 reaction consisting of following: 

200 jllM each dNTP (dATP, dCTP, dGTP, and 3 H-dTTP) 
0.2 mg/ml activated DNA 
10 mM KCI ' ■ ' - 

10 mM (NH 4 )2S04 
30 20 mM Tris-HCI, pH 8.8 at 25 °C 

2 mM MgS04 
0.1% Triton X-100 

Conventional techniques of molecular biology, microbiology and recombinant DNA techniques, which 
are within the skill of the art, are explained fully in the literature. See, e.g., Sambrook, Fritsch and Maniatis, 

35 Molecular Cloning; A Laboratory Manual, Second Edition (1989); Oligonucleotide Synthesis (M.J. Gait, ed., 
1984); Nucleic Acid Hybridization (B.D. Hames & SJ. Higgins, eds., 1984); A Practical Guide to Molecular 
Cloning (B. Perbal, 1984); and a series, Methods in Enzymology (Academic Press, Inc.). 

The present invention provides improved methods and reagents for the PCR amplification of long DNA 
targets. The PCR amplification process for the amplification of short nucleic acid sequences is well known, 

40 in the art and described in U.S. Patent Nos. 4,683,195; 4,683,202; and 4,965,188. Commercial vendors, 
such as Perkin Elmer, Norwalk, CT, market PCR reagents and publish PCR protocols. For ease of 
- understanding the advantages provided by the present invention, a summary of PCR is provided. 

In each cycle of a PCR amplification, a double-stranded target sequence is denatured, primers are 
annealed to each strand of the denatured target, and the primers are extended by the action of a DNA 

45 polymerase. The process is repeated typically between 25 and 40 times. The two primers anneal to 
opposite ends of the target nucleic acid sequence and in orientations such that the extension product of 
each primer is a complementary copy of the target sequence and, when separated from its complement, 
can hybridize to the other primer. Each cycle, if it were 100% efficient, would result in a doubling of the 
number of target sequences present. 

so In order to achieve efficient PCR amplification of long targets, several requirements must be met. First, 
target sequences must be completely denatured. Longer targets are increasingly likely to contain GC-rich 
stretches that are prone to incomplete denaturation because of their relatively high melting temperatures. 
Incomplete strand separation permits rapid renaturation of the target DNA, possibly precluding the 
annealing and extension of PCR primers. Second, extension times must be sufficiently long to allow the 

55 completion of strand synthesis in each PCR cycle. Third, long targets must be protected against 
degradation during amplification. Long targets are more susceptible to degradation and strand breakage 
under PCR conditions. Initial template integrity and subsequent strand survival during PCR are therefore 
important considerations. The methods of the present invention are designed to meet these requirements 
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for long PCR without compromising either polymerase activity or the specificity necessary for single-copy 

gene amplifications from genomic DNA. 

Improving target strand separation, lengthening the extension times, and protecting the template DNA 

from degradation during thermal cycling greatly increase the maximum amplifiable target length, but are 
s insufficient to achieve efficient amplification of targets in the 23-42 kb range. The fidelity of nucleic acid 

synthesis is a limiting factor in achieving amplification of long target molecules. 

The misincorporation of nucleotides during the synthesis of primer extension products limits the length 

of target that can be efficiently amplified. The effect on primer extension of a S'-terminal base that is 

mismatched with the template is described in Huang et al., 1992, Nucl. Acids Res. 20:4567-4573. The 
10 presence of misincorporated nucleotides may result in prematurely terminated strand synthesis, reducing 

the number of template strands for future rounds of amplification, and thus reducing the efficiency of long 

target amplification. Even low levels of nucleotide misincorporation may become critical for sequences 

longer than 10 kb. 

The fidelity of DNA synthesis is improved if a small amount of thermostable 3'-to-5' exonuclease, or 

is "proofreading", activity is present in the reaction in addition to the DNA polymerase activity. The 
proofreading activity apparently improves the yields of long products by removing misincorporated 
nucleotides and permitting complete strand synthesis by the predominant polymerase activity. An important 
aspect of the present invention refers to specific mixtures of thermostable DNA polymerases that greatly 
increase the maximum target length amplifiable by providing both 3Mo-5' exonuclease activity and 

20 polymerase activity. " . 

Proofreading exonuclease activity is not found in Tth DNA. polymerase (Myers and Gelfand, 1991, 
Biochemistry 30:7661-7666), but is inherent in the DNA polymerases from Thermococcus litora/is, 
Pyrococcus species GB-D, and Therm atoga maritima. However, amplification of long targets with Vent R ® 
DNA polymerases alone is less efficient than with Tth DNA polymerase which does not exhibit 3'-to-5' 

25 exonuclease activity. The decreased amplification efficiency is probably due, at least in part, to primer 
degradation and a decrease in net processivity resulting from the competition between, the 3'-to-5 f 
exonuclease and polymerase activities. 

The relative amounts of 3'-to-5 f exonuclease activity and polymerase activity can be controlled by 
mixing DNA polymerases. By combining a small amount of a secondary polymerase which has proofread- 

30 ing activity, such as Tli DNA polymerase, with an active primary polymerase, such as Tth DNA polymerase, 
the advantage of a proofreading activity can be combined with the active DNA polymerase activity inherent 
in the primary polymerase. 

Nearly all aspects of PCR protocols affect the amplification efficiency of long target molecules. 
Extension times, co-solvents, and polymerases (with and without S'-to-S'-exonuclease activity) are the most 

35 critical parameters, but the pH and composition of the reaction buffer, salts (K + and Mg 2+ ), and primer 
design are also important variables for the success of amplifications of long targets. The effects of the 
individual components of a PCR amplification on the amplification efficiency of long targets are discussed 
below; 

40 Temperature Cycling 

The amplification reactions exemplified herein use a two-step temperature cycle in which the reaction 
temperature alternates between a high temperature at which the target nucleic acid is denatured, and a 
lower temperature at which the primers anneal to the denatured target sequences and primer extension 

45 occurs. The time and temperature of each step in each cycle effects the efficiency of amplification. 

More complete target denaturation can be achieved by raising the denaturation temperature. However, 
raising the denaturation temperature may cause higher rates of damage, such as depurination, which 
decreases the amplification efficiency, as well as increases loss of polymerase activity. Although it is 
important to achieve complete denaturation of the target nucleic acid, the rate of target damage must be 

so simultaneously minimized. Consequently, moderate denaturation temperatures (e.g., about 94° C, depend- 
ing on GC content) are preferred, with the completeness of denaturation improved by the addition of co- 
solvents, as described below. 

A relatively high annealing temperature (e.g., about 68 °C) reduces the hybridization of primers to 
partially homologous target sites, thereby minimizing the synthesis of products from secondary priming 

55 sites. In amplifications using lambda DNA target as described in the Examples, a minimum of 5-6 minutes 
at 68 *C is needed. The addition of a more stringent 70-75 *C annealing step does not significantly improve 
yields. Similarly, more complex temperature profiles with temperature spikes to accommodate pot ntially 
problematic GC- or AT-rich stretches are not significantly beneficial. 

5 
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An extension time that permits the completion of strand synthesis is critical for achieving amplification 
of long targets. For the amplification of targets longer than 20 kb, an annealing and extension time of at 
least 12 minutes, but no more than 22 minutes in any cycle, is preferred! Minimum extension times are 
dependent upon other factors, such as co-solvent levels, as discussed below. Amplification reactions in 
5 which the initial extension time used is about 12 minutes and the extension time is increased 15-20 seconds 
per cycle yield less non-specific product formation than reactions in which an extension time of more than 
15 minutes is used throughout the amplification. The autoextension feature of the thermal cycler marketed 
by Perkin Elmer, Norwalk, CT f provides a convenient way to increase the extension times during an 
amplification reaction. 

10 

Reducing Amplification of Non-specific Targets 

Typically, PCR reagents' are combined at room temperature before the initial denatu ration step. The 
low, less stringent temperature can result in the binding of primers either to other primers or to partially- 

75 homologous target sequences. Extension products can be formed from this non-specific primer binding 
which can lead to short products that serve as extremely efficient target competitors, thereby reducing the 
efficiency of amplification of the desired long product A "hot-start" method minimizes the synthesis of 
primer extension product from non-specific primer hybridizations by inhibiting extension reactions until the 
reaction temperature is increased enough to prevent such non-specific binding. Since genomic templates 

20 are likely to contain sequences of partial homology to the target primer sequences, a hot-start protocol is 
important to maximize efficiency of long target amplification. , 

One method of achieving a hot-start involves withholding an essential PCR reaction component until the 
temperature of the amplification mixture has been raised to 75-80 *C. Examples include withholding either 
the UNA polymerase or Mg 2+ , which is an essential catalyst for DNA polymerase activity. In one hot-start 

25 protocol, the essential component is added by hand after the denaturation temperature has been reached. 
Alternatively, the essential reaction component can be withheld by separating reaction components within a 
reaction tube using a heat-labile barrier, such as a wax that melts at the reaction temperatures. This 
minimizes the number of times the reaction tube must be opened, thereby decreasing the possibility of 
contamination. - - 

30 Another hot-start protocol which may be useful in the methods of the present invention utilizes uracil-Nr 
glycosylase to degrade any non-specific product formed before the amplification mixture temperature is 
raised (see PCT Patent Publication No, WO 92/01814). 

PCR Reagents • - . 

35 j , • 

In a PCR, the primer extension reaction occurs when the primer-template mixture is incubated with a 
DNA polymerase under suitable polymerization conditions. These conditions are provided by a reaction 
mixture containing a divalent cation, a monovalent cation, all four deoxyribonucleotide triphosphates 
(dNTPs), and a buffering agent Co-solvents may be added to the reaction mixture which affect the 
40 denaturation conditions. Each of these components affects the efficiency of the extension reaction and is 
discussed separately below. 



DNA Polymerase 

45 The choice of the combination of thermostable DNA polymerases and their concentrations becomes 
particularly important as the target length or sequence complexity is increased. The combination of Tth 
DNA polymerase and 77/ DNA polymerase provides the most efficient amplification of long PCR products, 
and allows amplification of targets over 40 kb in length. 

The optimal amount of DNA polymerase in a PCR amplification depends on a number of factors, 

so including the number of copies of target sequences present in the sample. For high-copy reactions 10 7 
copies of target), higher yields are obtained by using 2-2.5 units (U) Tth DNA polymerase per 50 jxl 
reaction. Further increases in polymerase concentration result in: increase in the amplification of non- 
specific target molecules, resulting in higher background levels when the amplified products are. detected 
by agarose gel electrophoresis. For low-copy reactions 10 4 copies of target), however, specificity is 

55 maximized using about 0.8-1 U Tth DNA polymerase per 50 ul reaction. For intermediate copy numbers of 
target, maximum yields are achieved using intermediate polymerase concentrations. The optimal poly- 
merase concentration is also dependent on the divalent cation concentration. At higher Mg 2+ concentrations, 
polymerase levels were reduced to minimize accumulation of non-specific products. 

6 

BNSDOCID: <EP 0669401 A2_l_> 



V 



EP 0 669 401 A2 

Using PCR with Tth DNA polymerase alone, the maximum target size amplifiable from high-copy phage 
lambda DNA samples was found to be limited to about 23 kb. Similarly, the maximum target size 
amplifiable from low-copy phage lambda DNA samples was found to be limited to about 10-12 kb. Dramatic 
increases in the size of the amplifiable target are achieved by adding a small amount of thermostable 3'-to- 
5 5'-exonuclease. 

As described above, 3'-to-5' exonuclease activity is not found in Tth DNA polymerase. Proofreading 
activity is added by combining the Tth DNA polymerase with a small amount of thermostable DNA 
polymerase that has a proofreading activity, such as the DNA polymerases from Thermococcus litoralis, 
Pyrdcoccus species GB-D, and Thermotoga maritima. Low concentrations of any of these DNA poly- 
10 merases are effective in extending the range of target sizes amplifiable by PCR using either Tth DNA 
polymerase; however, a combination of Tth and ' 77/ DNA polymerases has been found to be the most 
reliable and efficient. 

The optimal concentration ratio is approximately 0.015-0.15 U 77/ DNA polymerase per 2-2.5 U Tth 
DNA polymerase for amplifications from high-copy samples 10 7 copies of target in a 50 ul reaction). For 
75 amplifications from low-copy samples (^ 10 4 copies of target in a 50 ul reaction), the optimal concentration 
ratio is approximately 0.015-0.15 U Til DNA polymerase per 0.8-1 U Tth DNA polymerase. Higher 
concentrations of 77/ DNA polymerase reduce yield, possibly due to primer degradation. 

Co-solvents • , 

20 

A co-solvent, such as glycerol, is a critical reaction component for the efficient amplification of long 
targets. A number of co-solvents have been reported to facilitate PCR, including glycerol, dimethylsulfoxide 
(DMSO). polyethylene glycol, and formamide. One way in which a co-solvent may influence the efficiency 
of long-target amplifications is by increasing the thermal stability of the DNA polymerase. Increasing the 
25 thermal stability slows the loss of DNA polymerase .activity during the repeated high-temperature denatur- 
ation steps. 

Another effect is that a co-solvent may effectively lower the melting and strand separation temperatures, 
thus facilitating the denaturation of the template and increasing the specificity of primer annealing. For 
example the melting temperature can be lowered by 2.5-3 'C by the addition of 10% glycerol. Thus, by the 

30 addition of a co-solvent, an increase in the completeness of target denaturation can be achieved without 
raising the denaturation temperature, which would simultaneously increase the degradation of target 
molecules, as discussed above. 

A standard Tth PCR buffer typically contains 5% (v/v) glycerol. An increase in the amount of glycerol 
added to an amplification reaction can significantly improve the amplification of long target sequences. 

35 Significant increases in the yield of a 9.4 kb target result from supplementing a standard Tth PCR buffer 
with 5% (w/v) glycerol. The percentages described here do not include any glycerol contribution from the 
various enzyme stocks used. 

DMSO, preferably in a concentration of about 5-6% (v/v), may also be used alone. However, 
combinations of glycerol and DMSO are more effective for longer targets. Preferred concentration combina- 

40 tions include 5-14% (w/v) glycerol with 0.5-5% (v/v) DMSO. For example, amplifications of phage lambda 
targets 25-34 kb long were enhanced by the combination of 1-3% (v/v) DMSO with 10% glycerol, or by 
using 5% of both co-solvents; amplifications of phage lambda targets 34-42 kb long were most enhanced 
by the combination of 8-9% glycerol with 5% DMSO. Furthermore, with a combination of 3% DMSO and 
10% glycerol, targets of up to 34 kb were readily amplified with a 10-minute extension time; with a 

45 combination of 1% DMSO and 10% glycerol, amplification was limited to 26 kb targets. A preferred 
combination consists of 10% glycerol and 2.25% DMSO. 

DMSO, unlike glycerol, reduces the thermal stability of the polymerase. However, the effective lowering 
of melting and strand separation temperatures by 5.5-6 °C per 10% DMSO may be the dominant effect in 
long PCR. The addition of DMSO may also increase the DNA stability by decreasing the rates of 

so depurination and/or chain scission and may accelerate strand renaturation. The reduction of melting and 
strand separation temperatures by combinations of glycerol and DMSO is generally consistent with a total 
reduction estimated by adding the effects of each component alone. The enhancement of yields resulting 
from the effective lowering of the melting and strand separation temperatures by the addition of a co- 
solvent, as discussed above, is not readily duplicated by raising the denaturation or annealing temperature 

55 during PCR. 
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Buffers , . 

The pH of an amplification mixture affects the stability of the template DNA. Increasing the pH of the 
reaction can decrease the degradation of template DNA during thermal cycling. Although PCR amplification 
5 mixtures are pH buffered, the pH of a typical PCR reaction varies considerably during the temperature 
cycling because of the temperature dependence of the reaction buffer. The, buffering agent used in a typical 
PCR is Tris, which has a ApKa of -0.031 per * C. The fluctuation in pH during the temperature cycling can 
be decreased by using a buffering agent with a smaller ApKa. 

.' Two suitable buffers are Tris(hydroxymethyl)methylglycine (tricine), which has a ApKa of -0.021 per ° C, 
io and N,N-Bis(hydroxyethyl)glycine (bicine), which has a ApKa of -0.018 per rC; both values measured at 
20 «C and 0.1 M ionic strength (see Good and Izawa, 1972, Meth. Enzymol. 24, Part B:53-68). With either a 
tricine or bicine buffer, the pH remains higher during the high temperature reaction conditions than with the 
typical Tris buffer, and the fluctuations in pH caused from the temperature cycling are decreased. 

Optimal buffers and pH are dependent on, among other things, the DNA polymerase used. Using Tth 
is DNA polymerase, a buffer consisting of 10-35 mM, preferably 20-25 mM, tricine at pH 8.5-8.7 (25 0 C) 
provides the most reliable results. Optimal buffer conditions may need to be determined empirically for the 
amplification of specific targets. 

Divalent Cation 

20 " • 

The preferred divalent cation for the amplification of DNA is Mg 2+ . In the absence, of added 3'-to-5'- 
exonuclease activity, long PCR is enhanced at total Mg2+ levels of 1.7-2 mM. In the presence of 
proofreading activity, however, the highest yields are obtained with 0.9-1.3 mM total Mg 2+ . Increased yields 
of some targets can be achieved by increasing the Mg 2+ concentration up to 1.5 mM while reducing the 

25 total enzyme concentration, particularly the primary polymerase levels (to 1.25-2 U Tth DNA polymerase). 
However, for some targets, reducing total enzyme levels in order to reduce the synthesis of non-specific 
products at higher Mg 2+ levels also reduces product yields. As with K + levels described below, the Mg 2+ 
optimum for each system may need to be determined empirically. 

. 30 Monovalent Cation 

The preferred monovalent cation is K\ supplied as KOAc (K-acetate) or KCI. For the amplification of 
long target molecules, reduced K + levels are beneficial. A decrease in non-specific background can be 
achieved if the K + is supplied as KOAc rather than KCI. In general, K + concentrations reduced by 10-40% 

35 are more favorable to long PCR than the standard levels (100 mM KCI for use with Tth DNA polymerase). 
Preferred concentrations for use with. Tth DNA polymerase are 60-100 mM KOAc, preferably 80-85 mM 
KOAc. Optimal concentration ranges may be system-dependent. 

The efficiency of PCR amplifications using tricine or, bicine buffers is similar using either KCI or KOAc 
as the monovalent cation. However, improved reaction robustness is realized using a tricine/KOAc buffer. A 

40 tricine/KOAc buffer has a slightly lower ionic strength than a tricine/KCI buffer, which could help destabilize 
secondary structures in a template with a high G + C content, thereby improving the completeness of target 
dehaturatibh. 9 >;w k * v ■- ' \-v ^>/, ^ " 

Although KCI and KOAc are the preferred monovalentsalts, other monovalent salts may be useful in the 
methods of the present invention. These include NaCI, (NHO2SO4, K-glutamate, and NH*-acetate. 
45 ■ 
Primers 

Primer concentrations may need to be optimized for each system and approximate' starting template 
copy number. For example, for the phage lambda amplification reactions described in the Examples, below, 

so a higher concentration of primer was optimal for amplifying samples containing a high copy number of 
target than was optimal for amplifying samples containing a low copy number of target. For the high-copy 
reactions (£ 10 7 copies of target), the optimum primer concentration was 0.4-0.5 uM of each primer. For 
low-copy amplifications (^ 10* copies of target),. 0.15-0.2 uM of each primer was most effective in the 
absence of proofreading activity, and 0.2 uM of each primer was best if 3'-to-5'-exonucleolytic activity was 

55 present. For intermediate copy-number reactions, increasing the primer concentration above 0.2 uM was as 
least as effective as increasing DNA polymerase levels, as discussed above, in enhancing yields. The 
improved PCR protocols that enable the amplification of target nucleic acid sequences up to 42 kb in length 
are summarized in Table 1 , below. 
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Table l 

Optimal Long PCR Conditions 

Temperature profile 

25 to 40 amplification cycles (template copy number dependent) 
Two-temperature cycling: 

(a) Short denaturation step, (e.g. 94°C for 10-15 seconds) 
10 (b) Long annealing/extension step, (e.g. 68°C for 10-14 minutes initially, 

increased by 15-20 seconds per cycle for at least 5-8 cycles) 
Final hold at 72°C for at least 10 minutes 

75 Hot-start 

Separate reagent (Mg2+, enzyme, or dNTPs) until all samples have reached 
75-80°C, preferably using a wax barrier. 



20 



25 



30 



35 



40 



Primary polymerase 

2.5 units Tth DNA polymerase per 50 \H for high-copy template (> 107 copies) 
0.&-1.0 units Tth DNA polymerase per 50 |il for low-copy template (< 104 copies) 

S'-to-S'-exonuclease (high- or low-copy template) 

0.015-0.15 units 77/ DNA polymerase per 50 pi 
Co-solvent 

5-14% glycerol with 0.5-5% DMSO 

Buffer 

20-25 mM tricine or bicine, pH 8.5-8.7 

Divalent cation 

0.9-1.5 mM Mg2+ total; 0.2 mM changes can be critical 

Monovalent cation 
80-85 mMKOAc 



Primer design 

Either 20-23 bp with 50-60% GC content, or longer sequences, to permit the use 
of relatively high annealing temperatures. 

45 Primer concentration 

0.4-0.5 jiM for high-copy template (> 107 copies) 
0. 1 5-0.2 |iM for low-copy template (< 104 copies) 

so dNTP concentration 

0.2 mM each dATP, dCTP, dGTP, dTTP 

In general, the nucleic acid in the sample will be DNA, most usually genomic DNA. Howev r, the 
55 present inv ntion can also be practiced with other nucleic acids, such as RNA or cloned DNA, and the 
nucleic acid may be either single-stranded or double-stranded in the sample and still be suitable for 
purposes of the present invention. Those skilled in the art recognize that whatev r the nature of the nucleic 
acid, the nucleic acid can be amplified using appropriate modifications to the present m thods. 
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Due to the enormous amplification possible with the PCR process, small levels of DNA carry-over from 
samples with h.gh DNA levels, from positive control templates, or from previous amplifications can result in 
PCR product, even in the absence of purposefully added template DNA. If possible, all reaction mixes are 
set up in an area separate from PCR product analysis and sample preparation. The use of dedicated or 
disposable vessels, solutions, and pipettes (preferably positive displacement pipettes) for RNA/DNA 
preparation, reaction mixing, and sample analysis will minimize cross contamination.. See also Higuchi and 
Kwok, 1989, Nature 339:237-238 and Kwok, and Orrego, in Innis et al. eds., 1990 PCR Protocols: A Guide 
to Methods and Applications, Academic Press, Inc., San Diego, CA. 

Enzymatic methods to reduce the problem of contamination of a PCR by the amplified nucleic acid 
from previous reactions are described in PCT Patent Publication No. WO 92/01814 and U.S. Patent No 
5,035,996. The methods allow the enzymatic degradation of any amplified DNA from previous reactions 
PCR amplifications are carried out in the presence of dUTP instead of dTTP. The resulting double-stranded 
amplification product which incorporates uracil is subject to degradation by uracil-N-glycosylase (UNG) 
whereas normal thymine-containing DNA is not degraded by UNG. Amplification reaction mixtures are 
treated with UNG before amplification to degrade all uracil containing DNA that could serve as target 
Because the only source of uracil-containing DNA is the amplified product of a previous reaction this 
method effectively eliminates the problem of contamination from previous reactions (carry-over) UNG is 
rendered temporarily inactive by heat, so the denaturation steps in the amplification procedure also serve to 
inactivate the UNG. New amplification products, therefore, though incorporating uracil, are formed in an 
UNG-inactivated environment and are not degraded. 

Analysis of the amplified products may be achieved by a variety of means depending on the 
information desired. The nucleotide sequence of amplified products can be. obtained using standard 
techniques, such as the protocol described by Innis et al., 1988. Proc. Natl. Acad. Sci. 85:9436-9440 The 
PCR amplification products can be sequenced directly (see Saiki et al., 1988, Science 239 487-491) or 

indirectly by first cloning the products and replicating them in an appropriate host cell. 

Amplified nucleic acid sequences can be detected and purified by methods well known in the art (see 
Sambrook, et al., 1989, supra). Methods which separate molecules according to size, such as gel 
electrophoresis, can be used to purify the amplified nucleic acid. In particular, agarose and/or acrylamide 
gel electrophoresis are preferred means for analyzing amplified products (see Scharf et al., 1986 Science 
233:1076-1078). For greater size resolution, either field inversion gel electrophoresis or low-percent (0 3%) 
agarose gel electrophoresis may be used, as described in the Examples. 

Amplified products can be detected by direct visualization of the electrophoretically size fractionated 
product by, for example, staining with ethidium bromide. Alternatively, amplified products can be detected 
using oligonucleotide hybridization probes which are complementary to the target sequence Under 
appropriate hybridization conditions, probes hybridize only to target nucleic acid sequences, The presence 
of hybrid duplexes, which can then be detected by various means, indicates the presence of amplified 
product. To facilitate the detection of hybrid duplexes formed between probes and target nucleic acid 
sequences, either the primers or the probes may be bound to additional molecules, such a detectable label 
or a molecule that enables the immobilization of the primer or probe. Labels incorporated into the probes to 
allow detection or immobilization should not affect the hybridization properties of the probes 

Probes can be labeled by incorporating a label detectable by spectroscopic, photochemical, biochemi- 
cal, immunochemical, or chemical means. Useful labels include 32 P; fluorescent dyes, electron-dense 
reagents, enzymes (as commonly used in ELISAs), biotin, or haptens and proteins for which antisera or 
monoclonal antibodies are available. Probes also can be bound to an additional compounds that are used to 
immobilize the probe on a solid support. 

Labeled probes can be synthesized and labeled using the techniques described above for synthesizing 
oligonucleotides. For example, the probe may be labeled at the 5'-end with *P by incubating the probe with 
P-ATP and kinase. A suitable non-radioactive label for SSO probes is horseradish peroxidase (HRP) 
Methods for preparing and detecting probes containing this label are described in U S Patent Nos 
4,914,210, and 4,962,029. The use of such labeled probes is also described in U.S. Patent No 4 789 630- 
Sa,k. et al., 1988. N. Eng. J. Med. 319:537-541; and Bugawan et al., 1988, Bio/Technology 6:943-947' 
Useful chromogens for the detection of HRP labeled probes include red leuco dye and 3,3\5,5'-tetramethvl- 
benzidine (TMB). 

Examples of additional compounds incorporated into probes to allow immobilization of the probes 
include a long poly-dT "tail" that can be fixed to a nylon support by irradiation, a technique described in 
more detail in PCT Patent Publication No. WO 89/11548. 

Suitable assay methods for detecting hybrids formed between probes and target nucleic acid se- 
quences in a sample are known in the art (Sambrook et al., 1985, supra). Examples include the dot blot and 
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reverse dot blot assay formats. 

In a dot blot format, unlabeled amplified target DNA is immobilized on a solid support, such as a nylon 
membrane. The membrane-target complex is incubated with labeled probe under suitable hybridization 
conditions, unhybridized probe is removed by washing under suitably stringent conditions, and the 
5 membrane is monitored for the presence of bound probe. 

An alternate format is a "reverse" dot blot format, in which the amplified target DNA is labeled and the 
probes are immobilized on a solid support, such as a nylon membrane. The target DNA is typically labeled 
during amplification by the incorporation of labeled primers. The membrane-probe complex is incubated 
with the labeled sample under suitable hybridization conditions, unhybridized sample is removed by 
io washing under suitably stringent conditions, and the filter is then monitored for the presence of bound target 
DNA. 

Alternatively, the reverse dot blot assay may be carried out using a solid support having a plurality of 
probe hybridization sites or wells. For example, a microwell plate is particularly useful in large scale clinical 
applications of the present methods. A reverse dot blot assay utilizing a microwell plate is described by 
is Loeffelholz et al., 1992, in J. Clin. Microbiol. 30 (11): 2847-2851. Probes can be immobilized to a microwell 
plate either by passive binding or by first binding the probes to bovine serum albumin (BSA), which adheres 
to microwell plates. 

Another suitable assay method system is described in U.S. Patent No. 5,210,015, in which a labeled 
probe is added during the PCR amplification process. The probes are modified so as to prevent the probe 

20 from acting as a primer for DNA synthesis. Any probe which hybridizes to target DNA during each 
synthesis step is degraded by the 5'-to-3' exonuclease activity of the DNA polymerase. The degradation 
product from the probe is then detected. Thus, the presence of probe breakdown product indicates that 
hybridization between probe and target DNA occurred. 

The present invention also relates to kits, multicontainer units comprising useful components for 

25 practicing the present method. A kit will contain a combination of preferred polymerase enzymes in the 
concentration ratios described herein. Additional components which may be contained in a useful kit include, 
primers for PCR amplification and reagents for carrying out the PCR methods of the present invention. { 
The ability to amplify sequences of 10-40 kb has a number of applications in areas such as genome 
mapping, sequencing, and genetics. Small gaps in the genome' maps that currently appear resistant to 

30 molecular cloning may be accessible by amplification of a sequence between known flanking sequences. 
The amplification of longer targets would also allow greater flexibility in choosing primers to avoid 
problematic sequences, such as that seen in the beta-globin gene system described below. Longer 
templates promise to speed the process of genomic sequencing as well, by increasing the distance covered 
with each sequencing step. From known expressed sequences, amplifications can be carried out spanning 

35 longer introns, and more complete genes sequences can be amplified at one time. Long PCR therefore 
complements technologies for rapid, long-range sequencing. PCR-based characterization and diagnosis of 
both homozygotes and heterozygote carriers of a number of medically important insertions and deletions of 
greater than 4 kb would also be possible. 

The results presented here specifically demonstrate the potential application of these protocols to the 

40 characterization of cloned sequences. The J and cro gene primers, CF1018 (SEQ ID NO: 23) and CF1019 
(SEQ ID NO: 24), described below should be useful for nearly all inserts cloned with lambda-based vectors, 
for amplifications from both plaques and . isolated DNA. The PCR products are readily analyzed by 
restriction digests and should be suitable for sequencing. Cosmid inserts may also be amplifiable from 
colonies. Long PCR will facilitate molecular cloning by amplifying low-copy insert material, and facilitate 

45 assembly of larger recombinant constructions in PCR-based mutagenesis. 

The examples of the present invention presented below are provided only for illustrative purposes and 
not to limit the scope of the invention. 

Example 1 

50 

Materials and Methods 

Preferred protocols and reagents for the PCR amplification of long phage lambda and human beta- 
globin gene cluster sequences are described below. The r suits of amplifications using the following 
55 methods are described in the subsequent examples. 
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Target Nucleic Acid Sequences 

hP .o TW °,H temPlate nUClei ° aCid Sequences were us * d the design of the amplification primers described 
below, the sequence of the phage .ambda genome (GenBank accession number M17233) and I the 
sequence ^ the human beta-g.obin gene Custer (GenBank accession number J00179). Phage lambda and 
human DNA were used in the amplifications described below 9 
^ Lambda DNA (1 ng/u.l) was obtained from Perkin Elmer, Norwalk, CT. Aliquots (-100 ng) of lambda 
DNA were thawed once, then stored at 4-C. Total genomic DNA from human placenta was ob Jnedfrom 

2 Tct STJSrt Louis ' MO ' A " temp,ate DNA dilutions were made with 10 mM ™" ^ ™ H * at 

A library of human genomic clones in lambda FIX II was obtained from Stratagene, La Jolla CA and 
s 9 o^od aS D ,Lr mmended ^ manufeCturer - on Luria ^oth agar plates with top agarose Random* 

(OH 83 ?0mMM W r? "T?* T 9 ' pipGttes ' and ***** 30 u. of 25 mM TrS 

(pH 8.3), 1 0 mM MgCI 2 and stored at 4 ".C. Aliquots of 1 ul were used for PCR 

• Total genomic DNA from a ■ lymphoblastoid cell line (KAS0T1 B) was isolated using 0 1 . md/ml 
protege K and 0.5% SDS in 10 mM Tris-CI ( P H 8), 150 mM NaCI, and 10 mM EDTA, ove nLt at 50^ 
Followmg extracts with Tris-saturated phenol ( P H 8), and ethanol-precipitation with NaOAc the sampS 
^1 mM EDm A ' ^ 6XtraCted ^ P heno| - chlorofo - ■ ■«"<* dialled against 10 mM T^s-CMpH 

, Primers . 



A set of primers was designed to enable the PCR amplification of lambda genomic target sequences 
25 ofT 9 ; 1 fr ° m 15 t0 422 kNObaSeS ,en9th " UpStream primers were d ^ d to beS SK each 
Ibies ^ PnmerS ' reSU ' tin9 in 3 Seri6S ° f tar96t Sequences * 'ength by 1 to S 

tem D E e a raLeTe 8 °c/^^Lr aS deS ' 9ned S ° * S t0 h3Ve W™™^ the same optimal annealing 
2 ' » I J 9 Pr ' mer sequences betw ^n 20 and 23 base pairs in length such that the 

30 C pailt LnVfiTi A T ^rT™ *** SeqU8nCe W ° U ' d have an overa " composition of l" 2 G- 

' ^^^^Z^^^ temPeratUreS using the "Tp" algorithm 

* n f ditional P*» of primers, the J and cro gene primers, were designed to enable amplification of 
2 beiow ,a,T,bda - baSed vectors - from *her plaques or isolated DNA, is SS!7?l2 

35 Th P S r! mi ' arly ' Prim ! rS W6re deSiQned f ° r the am P |ifi cation of regions of the human beta-globin gene cluster 
The primers were des.gned such that a fixed downstream primer could be used with a series of upsfream 
pnmen. to amphfy targets of 7.5-22 kb. The primers amplify a target region extending u^SSLfaSSTK 
delta-glob.n gene and into the second intron of the A-gamma globin gene am across the 

h.inl^M 0 .?' 60 *^ Sequences of the P rimers used the following examples are shown (5"-to-3') in Table 2 
B^em MoT S'SST^mT eSSM * * described " Wetmur, 1991 Jt Rev.' 

ends ; a s unL n^-ff o?" 9 temp ? rature calculations were carried out assuming 2 dangling 

we^ evaluated h 9yCer °' the Tm by 25 * a Primer nucleotide ■equenSw 

• were evaluated for potential secondary pr.mmg sites within the template DNA sequences and for inter- and 

« mtra-pnmer sequence comp.ementation using the O.igo 4.0 software (National Biosciences PlymouS mn" 
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10 



15 



20 



25 



Primer Seq ID No . 



Table 2 
Amplificati on Primers 
Sequence 



Position 



primers for phage lambda (GenBank accession no. M17233): 



CF1001 


1 


scion 


2 


CF1005 


3 . 


CF1007 


4 


CF1008 


5 


CF1010 


6 


SC1012 


7 


CF1012 


8 


CF1013 


9 


SC1000 


10 


SC1001 


11 


SC1002 


12 


SC1003 


13 


SC1008 


14 


SC1009 


15 


SC1016 


16 


SC1017 


17 


SC1018 


18 


SC1019 


19 


SC1021 


20 


SC1022 


21 


SC1024 


22 



GGTGCTTTATGACTCTGCCGC 

GCTGAAGTGGTGGAAACCGC 

GCTCTTTCCGCTCTGCCATC 

CGGCACTGGCAAGCAACTGA 

CCTCAACCGGATCGAAGGCT 

AGCGTGACGGTCACACCGTT 

GACTCTGGCCATCTGCTCGT 

GGACCTATCTGCCCGTTCGT 

GCCACCAGTCATCCTCACGA 

GCAGCGTGATTTCACGGTCG 

GCTCACATAACGTCCACGCAG 

GCCTCGCATATCAGGAAGCAC 

GGGTGACX3ATGTGATTTCGCC 

GGCATTCCTACXjAGCAGATGGT 

GGTCTGCCTGATGCTCCACT 

GTCGGACTTGTGCAAGTTGCC 

GCATGGATTCTGTCGACCCAC 

GAGAACCACCGAGCCTGATG 

AGCATTGGCCGTAAGTGCGATT 

GGCCTTGITGATCGCGCTTTGA 

TGTCACGCCTGCCTGTTGCTT 

GCGTTCCGCACGAGATACATG 



Ml 



304-324 


67 


506-525 


67 


* 1841-1860 


66 


* 4921-4940 


67 


* 6569-6588 


67 


* 9741-9760 


70 


* 10600-10619 


65 


* 12981-13000 


67 


* 14551-14570 


65 


* 17025-17044 


69 


* 19259-19279 


67 


* 21359-21379 


66 


* 23335-23355 


67 


* 26893-26914 


66 


* 28536-28555 


64 


* 30436-30456 


67 


* 32741-32761 


65 


* 34413-34432 


64 


* 35454-35475 


69 


* 38118-38139 


70 


* 39505-39525 


68 


* 42730-42750 


68 



30 



35 



Lambda vector primers, from the J and cro gene sites of phage lambda: 



CF1018 
CF1019 



23 AGAAACAGGCGCTGGGCATC 18872-18891 67 

24 CGGGAAGGGCTTTACCTCTTC * 38197-38217 66 



40 



45 



Primers for human beta-globin gene cluster (accession No. J00179): 



RH1019 


25 


CTGCTGAAAGAGATGCGGTGG 


54529-54549 


65 


RH1020 


26 


CTGCAGTCCCAGCTATTCAGG 


52152-52172 


63 


RH1022 


27 


CGAGTAAGAGACCATTGTGGCAG 


48528-48550 


65 


RH1024 


28 


TTGAGACGCATGAGACGTGCAG 


44348-44369 


67 


RH1025 


29 


CCTCAGCCTCAGAATTTGGCAC 


42389-42410 


65 


RH1026 


30 


GAGGACTAACTGGGCTGAGACC 


40051^0072 


65 


RH1016 


31 


CAGCTCACTCAGTGTGGCAAAG 


* 62589-62610 


64 


RH1053 


32 


GCACTGGCTTAGGAGTTGGACT 


* 61986-62007 


65 


* Downstream primer complementary to position numbers listed 





50 



Primers were synthesized using the cyanoethoxyphosphophoramidite method (1 uM scale) on a 394 
DNA Synthesizer (Applied Biosystems, Foster City, CA). The primers were deprotected and cleaved from 
the resin in 29% NH 3 /H 2 0, then desalted with Sephadex G25 (NAP-10 columns from Pharmacia LKB, 
Piscataway, NJ). The results of each synthesis were assessed by poly aery lam ide gel electrophoresis. All 
primer stocks w re made with 10 mM Tris^CI (pH 8 at 25 *C), 0.1 mM EDTA. 
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Thermostable DNA Polymerases 

.Recombinant Tth DNA polymerase (rTth) was purchased from Perkin Elmer, Norwalk, CT The 77/ DNA 
polymerase .s descr.bed in U.S. Patent No. 5,2 10 ,036. The Tli DNA po.ymerase (Vent R ® ) and DNA 

BeveTrMA ZtZThT f eCieS GB -° V6ntR@) Wer& P"-has P ed y from New England Biolabs 

Beverly MA^ The Tma DNA polymerase ,s described in International Patent Publication No. WO 92/03556 
and referred to therein as P 7Vna12-3. A modified DNA po.ymerase from Thermatoga maritima^s 
commercially available from Perkin Elmer, Norwalk, CT (UlTma™) mermawga mantima .s 

Dilutions (1/5 and 1/10) of the Vent R ® and Deep Vent R ® DNA polymerases preferably may be made in 
storage buffers as described by each manufacturer. In the Examples below, however, the Vent R ® dHution 

01 /o Triton X-100. Th.s mod.ficat.on had no effect on the amplification reactions. Vent R ® polymerase 
d,lut.ons were made fresh weekly; Deep Vent R ® po.ymerase. was diluted just before use The po ymerase 

20mM O T hpTJ^Y™ DNA buffer supp.ied by the M aZe^lTiS 

20 mM Tns-HCI, P H 8.0, 0.1 mM. EDTA, 1 mM DTT, 0.5% Tween® 20, 50% (v/v) glycerol). 

Additional Buffer Components 

*r-?l at nZt ^ P0lymerase buffer ( 5% W Glycerol, 10 mM Tris-CI (pH 8.3), 100 mM KCI 0 75 mM 
EGTA. a05% Tween 20) for PGR was obtained from Perkin Elmer, No^alk, CT -Tricine bufeY LTta 
(S-gma Chemicals, St. Louis, MO) at 1 .0 M were adjusted to their final P H (at 25 • C) w^m KO^ Molecular 

^^^^IrT^T^ 9lyCer0 ' W6re fr0m Si 9- Chemicals, St Louis MO a " t 
Bake clZ S !' ^ ,,lipSbu t r9 / NJ ' res P^«^'y- Potassium acetate (KOAc) was a.so obtained from J T 

.nc udeSTh^V , C ° ntnbut,0n 0f 9 , y cerol ***** * from enzyme storage buffers was not 
mcluded .n the glycerol concentrat.ons given for any PCR buffer described herein. 

PCR Methods . 

All lambda genomic DNA amplifications were performed in a GeneAmp® PCR System 9600 thermal 
cycler, us.ng MicroAmp™ tubes with individual caps, all marketed by Perkin Elmer UoZaTk CT ReacZ 
voumes were e.ther 50 or 100 ul. The concentration of each dNTP Was 0.2 mM fo Sf3oS"5?S2 
react.cn components were varied as discussed in the text and listed in Table 1 react '° n s. but other 

»h^^'« im ' 2e the amP ' ificati0n ° f n °n-specific sequences and the formation of primer-dimers manual 

thelXcleTS E^c fa!" PH 0 " ^ urt " the Samp ' eS had been incubated'n the 

™™ 1 ♦ ! ~ 90 seconds " The necessary Mg*+ was then added from a 25 mM stock (at 

ZZT^l 0 f:7 ng T add ?° n Q ° f MQ2+ - ^ SamP ' eS W6re inCubated for an addTtiona 30-60 
th ^ ,, 2S «T°< P "V° ^ firSt denaturati °" «•*»■ The total time indudes 
"hnt cT=fr+» h t 9 ' and therefore de Pends upon the total number of tubes. An alternate 
hot-start procedure is described in Example 6. alternate 

The thermal cycler was programmed to carry out a two-step temperature profile Each amolification 

2u C min C ute S s S A 1? ^h^" * 94 '° * 1 ° seconds ™°- d by .annealing and' eZSo at 68'C Tt 
^mum A 15 second denaturation step can also be. used. For annealing and extension times longer 
^ViZ^'^J^TV^^ of the thermal cycler was used to add 15-20 seconds pe 
thP . UteS - Reactlons were carried out for between 25 and 40 cycles, depending upon 

In intS 10 C0PV nUmb9r ' the tar96t ,en9th ' and the ' eaction c ^Zn S . In most reacts 

Th? 1 ,t ♦ "h 3 " 0 " SteP 31 94 ° C arid 3 final 10 minute '"cubation step at 72 -C were included 
F.X ^ andT ; th SS r bed the f0 " 0Win9 eXampl8S ' 0f human 9 enomic Coned into"ambda 

Z with ?h« T ? th l hUman beta -9' obin *™ ch-tor were carried out essentially as described above 

'^J1^S^^ M below - Specific conditior,s for the amplification of h " man 9- omi ' sss 

cloned in lambda FIX II from plaque suspensions in 1 00 ul reaction volumes were as follows 
25 mM tricine (pH 8.7) 
85 mM KOAc 



12% (w/v) glycerol 
0.2 mM each dNTP 
55 0.4 uM each primer 

1 .75 U Tth polymerase 
0.02 U Tli polymerase 
1.15 mM Mg(OAc) 2 
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An 80 °C hot-start was used with a two-step thermal cycling profile, as described above. The annealing and 
extension step was initially 12 minutes at 68 °C and extended by 15 seconds per cycle for 32 cycles. 

Specific conditions for the amplification of a region of the human beta-globin gene cluster from 37 ng of 
KAS011 DNA in 50 ul reaction volumes were as follows. 
5 20 mM tricine (pH 8.7) 

85 mM KOAc 

10% (w/v) glycerol 

2% (v/v) DMSO 

0.2 mM each dNTP 
w 0.2 liM each primer 

0.9 U Tth polymerase 

0.02 U 77/ polymerase 

1.1 mM Mg(OAc) 2 

A 78°C hot-start was used with a two-step thermal cycling profile, as described above. The annealing and 
15 extension step was initially 12 minutes at 68°C for 12 cycles, then extended 15 seconds per cycle for 24 
cycles. 

Increased yields of amplified product may be obtained by the addition of up to 500 ug/ml of 
nonacetylated BSA to the amplification reaction. 

20 Analysis of PCR Products 

Typically, 5-8 ul from each PCR amplification were analyzed on standard horizontal gels consisting of 
0.6% (w/v) SeaKem GTG agarose (FMC BioProducts, Rockland, ME) in 1X TBE (89 mM Tris base, 89 mM 
boric acid, 1 uM to 2 mM EDTA) or 1X TAE (40 mM Tris-acetate, 2 mM EDTA, pH 8-8.5) with 0.5 ug/ml 
25 ethidium bromide, at about 4-6 V/cm for 1.5-2 hours. For greater size resolution, two alternatives were used: 
field inversion gel electrophoresis and 0.3% agarose gel electrophoresis. 

Field inversion gel electrophoresis (FIGE) was performed using a Hoefer system (SuperSub gel 
apparatus, Switchback pulse controller, and power supply, all from Hoefer, San Francisco, CA) with a 
cooling unit (2219 Multitemp II from Pharmacia LKB). Between 3 and 7 ul from each PCR amplification 
30 were analyzed on FIGE gels of 0.95% agarose in 0.5x TBE (at 1 uM EDTA). The FIGE gels were prerun for 
15. minutes at 110 V, then run for 22-25 hours at 140-145 V, with pulse times of 0.65-1.95 or 0.75-2 seconds 
(forward:reverse = 2.8:1 or 3:1). Run temperatures were estimated at 12-15 °C. 

Alternatively, load 2-5 ul on 0.3%. Chromosomal Grade agarose (Bio-Rad, Richmond, CA) or Seakem 
GTG or Gold (FMC BioProducts, Rockland, ME) in 1X TAE. Cool the gel to 4°C before removing the comb. 
35 Load 5-8 ul of sample and run in 1X TAE with 0.5% ethidium bromide at 100 V for 2 minutes, then either at 
1 .5 V/cm for 6 hours or at 0.7 V/cm for 1 6 hours. 

The size of the amplified products was determined by comparison with molecular weight markers run 
on each gel in addition to the sample. Molecular weight markers used were lambda/W/ndlll from either New 
England Biolabs or Gibco BRL, lambda/mono cut mix from New England Biolabs, and 1-kb ladder from 
40 Gibco BRL 

For restriction analyses, aliquots (10-16 ul) of PCR amplification product from lambda DNA amplifica- . 
tions were digested with fic/l, BssHII, and Mlu\ (New England Biolabs); or fiamHI, EcoRI, and tf/ndlll (Gibco 
BRL) f using the manufacturer's buffers, prior to electrophoresis. Digestions were carried out for 2.5-3 hours 
in 30-36 ul reactions. Samples were analyzed using 0.6-8% agarose gels. Aliquots of plaque PCR samples 
45 (10-30 ul aliquots) were digested with Not\ (Stratagene) overnight in 40 ul reactions. 

Example 2 

Amplification of Phage Lambda Genomic Sequences 

50 

Amplifications were carried out using target sequences from high copy (lOMO 8 copies of target) phage 
lambda DNA samples as described in Example 1, above. Targets of 1.5 to 42.2 kb were defined within this 
~50-kb sequence (GenBank M17233) by the various pairings of the primers listed in Table 2, above. 

Amplified product was analyzed by field inversion gel electrophoresis (FIGE) and visualized with 
55 ethidium bromide staining. Total yields (per 50 ul), as estimated by comparison with a lambda/H/ndlll 
molecular weight marker, were estimated at between 0.7-1 ug of 22.8 kb product and 0.2-0.3 ug of 39-kb 
product. A 42.2 kb target, amplified using primers SC1011 (SEQ ID NO: 2) and SC1024 (SEQ ID NO: 22), 
was amplified with lower yields. 
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Example 3 

Amplification of Lambda Clones From Plaques 

c>onerw!Z°? ant US ! melh ° dS ° f the pr6Sent invention is the amplification of inserts from lambda 

T^hTI >h Pr,0r ' 1 , ,ab0r and time intensiv « DNA isolations. To demonstrate the utility of the present 
methods to the amplification of such inserts, primers CF1018 (SEQ ID NO: 23) and CF1019 (SEQ ID NO 
24) were des.gned from sequences within the J and era genes of lambda (see Table 2) 

Ampl,f,cat.ons were carried out as described in Example 1 using randomly selected olaaues from th» 

a72rTT C m Tl in ' ambda RX ■" d6SCribed in Example A"p.ification y p roduct^ analyzed by 
gel electrophoresis following digestion with Notl to separate the insert from flanking vector sequences The 
presence of both vector fragments confirms that the entire insert was amplified ^nces. The 

estii^s^nslt^ J «5fi ran9ed fr0m ISSS th3n 10 kb t0 9rCater ,han 20 kb " The manufacturer 
estimates hat insert sizes of 9-23 kb are accommodated by this lambda vector. Inserts were sized bv their 
mobility relative to molecular weight markers in FIGE gels. V 

Example 4 

Amplifications of Human Genomic Targets 

The human beta-globin gene cluster was chosen as a model for.genomic targets that are likelv to 
contain repetitive sequences and homologous sites elsewhere in the genome Prime s designed E ^the 
human beta-globin gene cluster are shown in Tab,e 2. above. A fixed downstream S^K^t^ 
2S ZZr TT m fT erS ^ 3mPlify 3 re9i ° n extendin 9 u P stream across the delta-globin gene and nto 
?7 n n n I ■ , f 9amma 9 ' 0bin 9ene " Tar96tS ° f 13 - 5 ' 17 - 7 - 19 6 - and 2 2 ^ were ampHfied ' 1 
fLlffi 1 ST* ° f t0tal hUman 9 enom ' c DNA as described in Example 1. A.iquots of 12 5 u! of The 
comparison! ^ ^ °° RGE 9 ^ \ ,ambda/ »" d '» molecular weight' marker was uled for 

30 05 nn f°iT riSOn ' TT S ° f 16 - 5 ' 188 ' 20 ' 8 ' and 22 8 kb were amplified from 0.05 pg. (-i 0 3 copies) or 
30 0.5 pg (-10* copies) phage lambda DNA in a backqround of -3 7 no nr nn «LL , J 
Placenta, genomic DNA, under the same conditions. ^^^^^'.^^JS^ 
a ge prev.ous.y amplified from a high input target number, the effects attributab.e toa"e«eL7™M 
target copy number can be separated from the effects attributable to a difference in target seance 

t ar nZt 9 I ", P * 22 "° ' en9th ° f the b * a -°l°bin gene cluster were amplLd The Sa-dlobin 

targets were amplified less efficiently than lambda sequences of similar length that were at siSeSov ' 

or at a" lO^ofdT " * mn P ' a . Cental DNA ' eith - * *» -me overal. concentration's the 8 Sn target 
r™, \ T concentrat,on - Th ese efficiency differences may reflect the relative sequence 

SSh^ ! h ° U9h ^ ' ambda tar96t W3S a,S0 in a huma " 0*L*6 background I The TncrSsed 
22 L Th 9 tar96tS Wi " C ° ntain Sit6S SUfficient,y h^o'ogous to act as secondary primer annSnq 
sites and the presence of repetitive sequences in human genomic sequences may exp£ Twhv lambda 
targets were more efficiently amplified then beta-globin genetergets of comparable .Sgth " 

- tion of hPtTS ° f * SeC °" dary Primin9 sites a 'so affected the choice of suitable primers'for the amplifies- - 
tion of beta-globin gene targets. Downstream primer RH1053 (SEQ .D NO: 32), which hybrid zes ™ to toe 
be a-g obin gene, was chosen because RH1016 (SEQ ID NO: 31), which hybMira vXT^ 2 of K 
beta-globm gene, also hybridizes to a secondary sites within targets longer tha 1 ^ kb Tsu.tiTq in mlote 
thT ule nf ^ U r eam Primer RH1G2 ° (SEQ ' D N0: 26 > resul ^ d "•laSjHSSi ^£ 

Tn rrepersCner 6 " ^ Sh ° Wn) ^ 1 °° b ~ 8 '° f RH102 ° <^ ID NO: 26).^^^ 
n^T" 3 fr ° m amplifications of sequences up to 16 kb in length from the human neurofibromatosis-1 
KSSSr meth ° dS t0 inSUre Primer specificit V are crucia, to efficient P^XSSTof ' 



Example 5 ' 
55 DNA Polymerase Combinations 



carriln ^VT^ "? cien( * ° f Vari ° US DNA P o| V m ^se combinations, amplification reactions were 

earned out essent.ally as described in Example 1, above, using primers which, amplify tor^^iSS- 
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22.8, 26.4, 29.9, and 33.9 kb in length. The DNA polymerase combinations compared were as follows: 

2.5 U rTth DNA polymerase + 0.02 U Vent R © DNA Polymerase 

2.5 U rTth DNA polymerase + 0.06 U Deep Vent R ® DNA Polymerase 

3.15 U rTth DNA polymerase + 0.5 U Tma DNA Polymerase 
5 All reactions were carried out in 50 ul, with 10 7 copies of lambda DNA, 0.45 uM each primer and 1.0- 

1.1 mM Mg(OAc)2- Amplification reactions using the following specific conditions. 

Reactions using either rTth and Vent R ® or rTth and Deep Vent R ® DNA polymerases were carried out 
in 20 mM tricine (pH 8.7), 85 mM KOAc, 10% glycerol, and 3% DMSO. Reactions using rTth and Tma 
DNA polymerases were carried out in 20 mM tricine (pH 8.7), 85 mM KOAc, 10% glycerol, and 2.5% 
70 DMSO. 

The temperature cycling profile was essentially as described in Example 1, above. An initial 13-minute 
extension time was used for the first 9 cycles. The extension time was then increased to 13.5 minutes and 
increased 20 seconds in each subsequent cycle for 18 cycles. Seven ul aliquots of each reaction were 
loaded on a standard agarose gel along with 150 ng of the lambda/H/ndlll molecular weight marker. 
75 All templates (to 33.9 kb) were amplified using combinations of rTth DNA polymerase with Vent R ®, 
Deep Vent R ®, and Tma DNA polymerases. The combination of 2.5 U rTth DNA polymerase and 0.02 U 
Vent R ® DNA Polymerase amplified all targets with the greatest efficiency. 

Example 6 

20 ' 
PCR Amplification Kit 

The reagents of the invention are suitable for inclusion in a kit for carrying out the PCR amplification of 
long target sequences. A kit contains at least a DNA polymerases mixture as described herein. Additional, 
25 optional, components include additional reagents and reaction containers used in the reactions as described 
below. 

A preferred combination of DNA polymerases useful for amplifying both high copy and low copy targets 
consists of rTth and Vent R ® DNA polymerases in a ratio of 2 units of rTth DNA polymerase to 0.08 units of 
Vent R ® DNA polymerase. Although, as shown below, the preferred polymerase concentration for the 
30 amplification of high copy targets is twice the preferred concentration for the amplification of low copy 
targets, the ratio of primary to secondary polymerases is the same. 

A reaction buffer suitable for inclusion in a kit consists of tricine, KOAc, glycerol, and DMSO in about 
the following concentrations: . 

25 mM tricine (pH 8.7) 
35 80 mM KOAc 

10% (w/v) glycerol 

2.25% (v/v) DMSO 

The term "about" is meant to encompass a standard plus or minus 10% manufacturing tolerance. For 
convenience, the reaction buffer may be stored at a higher concentration and diluted before using. 
40 Amplifications are carried out using the preferred kit components essentially as described above, but 
using the preferred reaction conditions described below. These reagents and conditions have been used 
extensively and have been found to provide reliable amplification of long target sequences. 

Preferred conditions for the amplification of low copy (e.g. human genomic) targets (2.0 x 10 4 copies) in 
100 ul reaction volumes are as follows. 
45 25 mM tricine (pH 8.7) 
80 mM KOAc 
10% (w/v) glycerol 
2.25% (v/v) DMSO 
0.2 mM each dNTP 
so 0.2 uM each primer 
2 U rTth polymerase 
0.08 U Vent R ® polymerase 
1.1 mM Mg(OAc) 2 

Preferred cycling parameters for the amplification of low copy targets (>10 kb) are as follows: 
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25 
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Denaturation 


94'C 


1 minute ' 


20 Cycles 


94 -C 


15 seconds 




68 : C 


12 minutes 


17 Cycles 


94 'C 


1 5 seconds 




68 'C 


12 minutes with 15 second autoextend 


Final Extend 


72-C 


10 minutes 


Hold 


4 * C 


indefinite . 



25 mM tricine (pH 8.7) 

80 mM KOAc ' ' - , 

\ 10% (w/v) glycerol . 

2.25% (v/v) DMSO • ' 

0.2 mM each dNTP - 
0.4 jllM each primer 

4 U rTth polymerase • 
0.16 U Vent R ® polymerase ' 
1.1 mM Mg(OAc) 2 . 

Preferred cycling parameters for the amplification of high copy targets (>,0 Kb) are as follows: 



Denaturation 


94 *C 


1 minute \ •'. 


16 Cycles 


94 °C 


15 seconds 




68 -C 


10 minutes 


12 Cycles 


94* C 


15 seconds 




68 *C 


10 minutes with 15 second autoextend 


Final Extend , 


72 * C 


10 minutes 


Hold 


4*C 


indefinite 
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100 w^aTs, ^SS^T Jo*" 1 th6 r aC R ti0n tUbeS Ampliwax™ PCR Gem 

Norwalk, CT. A 40 u. bottom ^0™^^ 3nd marketed b * P ™"» Bmer, 

2. and the dNTP's is ad^Tto thf rSn ? 9 t (tr,C,ne ' K ° AC ' 9lyCero1 ' and DMS0 >' M 9(OAc)- 

an Am P ,,W» PCR 0^^^ by adding 

25 -C for 5 minutes. A 60 ul too reaoent Lor iHhTn o7h ! C f ° r 5 minutes - and then at 

mixture, the primers, and the tergeToNA " * ^ ^ C ° nta,nin 9 buffer - the Dl ™ polymerase 

hour?aT P 7' Wcm" ana ' y2ed " ^ ° n A 9 arose * in IX TAE and 0.5 ug/m, EtBr for 1.5 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 

<i) APPLICANT: 

(A) NAME: F.Hoffmann-La Roche AG 

(B) STREET: Grenzacherst rasse 124 

(C) CITY: Basel 

(D) STATE: BS 

10 (E) COUNTRY: Switzerland 

(F) POSTAL CODE (ZIP) : CH-4002 

(G) TELEPHONE: (0)61 688 24 03 

(H) TELEFAX: (0)61 688 13 95 

(I) TELEX: 9622 92& 965542 hlr ch 

75 (ii) TITLE OF INVENTION: Amplification of Long Nucleic Acid Sequences 

by PCR 

(iii) NUMBER OF SEQUENCES: 32 

<iv) COMPUTER READABLE FORM: 
20 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible . 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) . 

25 (2) INFORMATION FOR _SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
.(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: - 
GGTGCTTTAT GACTCTGCCG C 21 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
40 GCTGAAGTGG TGGAAACCGC 20 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
GCTCTTTCCG CTCTGCCATC 20 

50 
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(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid - 

(C) STRANDEDNESS : single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
_ {xx) SEQUENCE DESCRIPTION : SEQ ID NO 
CGGCACTGGC AAGCAACTGA 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
75 < c > STRANDEDNESS: single 

(D) TOPOLOGY: linear 
Ui) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 
CCTCAACCGG ATCGAAGGCT 

20 (2) INFORMATION FOR SEQ ID NO: 6- 

(i) SEQUENCE CHARACTERISTICS- 
- (A) LENGTH: -20 base pairs 

(B) TYPE: nucleic acid . 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(if) MOLECULE TYPE: DNA (genomic) 

-n^iJ 1 * SEQUENCE DESCRIPTION: SEQ ID NO • 
AGCGTGACGG TCACACCGTT 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs , 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
Ui) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO • 1 

GACTCTGGCC ATCTGCTCGT * 

(2) INFORMATION FOR SEQ ID NO: 8 • 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY:, linear . - . 
( ii ) MOLECULE TYPE : DNA * (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO * 8 
GGACCTATCT GCCCGTTCGT 

45 - 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
50 <C> STRANDEDNESS: single 

<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO ; 9 : 
GCCACCAGTC ATCCTCACGA 20 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20. base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GCAGCGTGAT TTCACGGTCG 20 

(2) INFORMATION FOR SEQ ID NO: 11: 
75 (i) SEQUENCE CHARACTERISTICS: < 

(A) LENGTH: 21 base pairs * 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

G C TC AC AT AA CGTCCACGCA G 21 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
30 GCCTCGCATA TCAGGAAGCA C 21 

(2) INFORMATION FOR SEQ ID NO:13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear _ 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GGGTGACGAT GTGATTTCGC C 21 

40 (2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GGCATTCCTA CGAGCAGATG GT 22 
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(2) INFORMATION FOR SEQ ID NO:15: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 base pairs 
■■.(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
Ui) MOLECULE TYPE: DNA (genomic) 
v^o^i- 1 * SEQUENCE DESCRIPTION : SEQ ID NO -15 
GGTCTG.CCTG ATGCTCCACT ■ ' " 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA (genomic) 

™^i XX) SE Q UEN CE DESCRIPTION: SEQ ID NO- 16 
GTCGGACTTG TGCAAGTTGC C b 

(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 
20 < A > LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS.: single ■ 

(D) TOPOLOGY: linear 

(if) MOLECULE TYPE: DNA (genomic) 
^< Xl) SEQUENCE DESCRIPTION: SEQ ID NO'17- 
25 GCATGGATTC TGTCGACCCA C wu.l/. 

(2) INFORMATION FOR SEQ ID NO:18- 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
..(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
Ui) MOLECULE TYPE: DNA (genomic) ' 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO* 18- 

GAGAACCACC GAGCCTGATG U U ' 18, 



30 
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(2) INFORMATION FOR SEQ ID NO: 19: 



(i) 



SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single • 
40 (D) TOPOLOGY: linear 

(if) MOLECULE TYPE: DNA (genomic) 
(X1) SEQUENCE DESCRIPTION: SEQ ID NO-19- 
AGCATTGGCC/GTAAGTGCGA TT ; _ ' V i^ NU.iy. , 

(2) INFORMATION FOR SEQ ID NO- 20 • 
45 (i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

so , (1 f| MOLECULE TYPE: DNA (genomic) 

n^i XX) SEQUENCE DESCRIPTION: SEQ ID NO-20- 
GGCCTTGTTG ATCGCGCTTT GA U ' 
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(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
5 (B) TYPE: nucleic acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
TGTCACGCCT GCCTGTTGCT T 21 

10 

(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
75 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
GCGTTCCGCA CGAGATACAT G 21 
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(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
AGAAACAGGC GCTGGGCATC 20 

(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CGGGAAGGGC TTTACCTCTT C 21 

(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CTGCTGAAAG AGATGCGGTG G 21 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: " 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

■■n^i^l SEQUENCE DESCRIPTION: SEQ ID NO-26 
CTGCAGTCCC AGCTATTCAG G 

(2) INFORMATION FOR SEQ ID- NO-27- 
(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH : 23 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

l^l o^f CULE TYPE: DNA <genom±c) 
' < X±) SEQUENCE DESCRIPTION: SEQ ID NO-27- 
CGAGTAAGAG ACCATTGTGG CAG U '' * 

(2) INFORMATION FOR SEQ ID NO:28' 
(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
™^*i Xl) SEQUENCE DESCRIPTION: SEQ ID NO-28" 
TTGAGACGCA. TGAGACGTGC AG ' * 

(2) INFORMATION FOR SEQ ID NO:2 9* 
(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
' ,™ (X1) SEQUENCE DESCRIPTION: SEQ ID NO'29- 
CCTCAGCCTC AGAATTTGGC AC W.xuwu.zy. 

(2) INFORMATION FOR SEQ ID NO: 30- 
(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single ' 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
V^^i*^ S£ QUENCE DESCRIPTION: SEQ ID NO- 30- 
GAGGACTAAC TGGGCTGAGA CC UJU ' 

(2) INFORMATION FOR SEQ ID NO:31- 
(i) SEQUENCE CHARACTERISTIC^: 

-(A) LENGTH : 22 base pairs, * , 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

, (ii) MOLECULE TYPE: DNA (genomic) 
r*^ UX) SEQUENCE DESCRIPTION: SEQ ID NO-31- 
CAGCTCACTC AGTGTGGCAA AG Ji " 

(2) INFORMATION FOR SEQ ID NO: 32- 
(i) SEQUENCE CHARACTERISTICS * 
(A) LENGTH: 22 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GCACTGGCTT AGGAGTTGGA CT 22 



10 

Claims 

1. A DNA polymerase composition for the polymerase chain reaction amplification of long nucleic acid 
sequences consisting of a combination of a first DNA polymerase and a lesser quantity, measured in 
75 units of polymerase activity, of a second DNA polymerase, wherein said first DNA polymerase is 

Thermits thermophitus DNA polymerase, and wherein said second DNA polymerase is selected from 
the group of DNA polymerases consisting of Thermococcus fitoralis DNA polymerase, Pyrococcus 
species GB-D DNA polymerase, and Thermotoga maritima DNA polymerase. 

20 2. The DNA polymerase composition of Claim 1, wherein said second DNA polymerase is Thermococcus 
litorah's DNA polymerase. 

3. The DNA polymerase composition of Claim 2, wherein said DNA polymerase composition consists of 
about 0.8-2.5 units of first DNA polymerase for each 0.015-0.15 units of second DNA polymerase. 

25 

4. The DNA polymerase composition of Claim 2, wherein said DNA polymerase composition consists of 
about 2 units of first DNA polymerase for each 0.08 units of second DNA polymerase. 

5. A reaction buffer for the polymerase chain reaction amplification of long nucleic acid sequences 
30 comprising about 25 mM tricine, 80 mM KOAc, 10% (w/v) glycerol, and 2.25% (v/v) DMSO. 

. 6. A process for arnpliying a nucleic acid using a DNA polymerase composition as claimed in any one of 
Claims 1 to 4. 

35 7. Use of a DNA polymerase composition as claimed is any one of Claims 1 to 4 for amplifying a nucleic 
acid. 

8. A kit comprising a DNA polymerase composition as claimed in any one of Claims 1 to 4. 

40 9. A kit of Claim 8 further comprising a reaction buffer for the polymerase chain reaction amplification of 
long nucleic acid sequences, wherein said reaction buffer preferably comprises about 25 mM tricine, 80 
mM KOAc, 10% (w/v) glycerol, and 2.25% (v/v) DMSO. 

10. The invention as hereinbefore described. 

45 
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